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IDENTIFYING PEPTIDE LIGANDS OF TARGET PROTEINS WITH TARGET 
COMPLEMENTARY LIBRARY TECHNOLOGY (TCLT) 
FIELD OF THE INVENTION 
This invention relates to methods of screening for peptide ligands of target 

5 proteins. 

BACKGROUND OF THE INVENTION 
Rational drug designs are more and more dependent on identifying binding 
ligands which interacts with a target protein and the ability to predict their points of contact. 
Peptide library technology has become a very powerful tool in this regard. Large 
10 collections of peptides with randomized sequences are prepared and displayed on the 
surface of genetic packages, i.e., replicable entities such as cells or viruses, especially 
phage. Peptides having specific binding properties are subsequently selected by multiple 
^ rounds of panning (selection by binding affinity to an immobilized target) and amplification 

of the bound genetic package, such as phage (!). Phage display peptide~iibraries have been 
!Ij 1 5 successfully screened to identify peptides that bind to cell surface or intracellular receptors, 

s g substrates or inhibitors of enzymes, and epitopes recognized by polyclonal and monoclonal 

O antibodies (2-10). 

SUMMARY OF THE INVRNTTON 
!L. This invention features an improved method of screening for peptide ligands 

jy 20 of target proteins. This method is more powerful than the random peptide library 

S technology in that it utilizes a predefined pool of peptides containing ligands for a selected 

q tar S et protein. The peptides in the pool exhibit complementary hydropathy to the target 

I 3 * protein. Peptide ligands are selected based on their ability to bind to the target protein. 

This method is hereinafter referred to as Target Complementary Library Technology 
25 (TCLT). 

TCLT can be used in all occasions when random peptide libraries are used to 
identify ligands for target proteins. By identifying peptide ligands which can influence the 
biological activities of their target proteins, TCLT can be used to discover peptides of 
therapeutic utility. In addition, peptide ligands are useful for studying the dynamic structure 
30 of their target proteins, and such information can be used for designing functional 
pharmacological small molecules. 

Applications of TCLT include, but are not limited to, the following: 
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1) Identifying peptides which functionally mimic idiotypic antibodies. By 

mimicking the anti-idiotypic antibodies of the beta type (Ab2beta) (62), a peptide ligand 
can block the antigen-binding site of an antigen-specific antibody. By mimicking the anti- 
idiotypic antibodies of the gamma type ( Ab2gamma) (62), a peptide ligand can prevent 
antigen-binding of an entire class of antibodies, such as IgG or IgE. Specifically, the 
framework 2 and framework 3 (FR2 and FR3) segments of both the heavy chain and light 
chain of a immunoglobulin molecular can serve as the target sequence for binding peptides 
which mimic the function of an Ab2gamma antibody. Each of the complementarity - 
determining - regions (CDR1, CDR2 and CDR3) can serve as the target sequence for AB2 
beta-mimicking peptides. The target proteins include IgE, IgG, IgA, and specific antibodies 
such as anti-acetylcholine receptor antibodies. Peptide ligands of these target proteins can 
be used for the prevention and treatment of allergy, and autoimmune diseases including 
rheumatoid arthritis, myasthenia gravis, systemic lupus erythematosis, and autoimmune 
nephritis. 

2) Identifying peptides which mimic the function of a biological active 
molecule, such as a hormone, a cytokine, a growth factor, or a chemical compound. The 
target proteins include the cell surface receptors and intracellular receptors for these 
molecules, including, but not limited to, the receptors for insulin, human growth hormone, 
erythropoietin, thrombopoietin, estrogen, androgen, epidermal growth factor, interferons, 
interleukins, fibrinogen, endorphin, enkephalin, dopamine, nicotine, acetylcholine, 
angiotensin, and somatostatin. 

3) Identifying peptides which mimic the binding domain of a receptor. By 
binding to the receptor's native ligand, the peptide ligands act as decoys to compete with 
the receptor, thereby preventing the biological effects induced by the interaction between 
the receptor and its native ligand. The target proteins in this case include the biological 
active peptides and proteins, which include, but are not limited to, interleukin-1, 
interleukin-2, bradykinin, platelet activating factors, and tumor necrosis factor (TNF). 

4 ) Identifying peptides which mimic the receptor-binding domains of virai or 
bacterial proteins. Such peptides can block viral or bacterial infections by preventing the 
virus or bacteria from binding to its cellular receptors. In this case the target proteins can 
either be the known cellular receptors for virusestoacteria, such as ICAM-1 and LDL 
receptor which are the major and minor receptors for human rhinoviruses (63, 64), or a viral 
or bacterial surface protein known to bind to corresponding cellular receptors. 
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5) Identifying peptide ligands for any protein. The peptide Hgand can either be 
used for affinity purification of the target protein, or as a probe for the detection of the 
target protein in vitro and in vivo. 

6) Identifying an unknown receptor for a known ligand. and an unknown ligand 
5 for a known receptor. In this case, the known ligand/receptor would serve as the target and 

the corresponding receptor/1 igand can be identified by one of the following approaches. 
First, the complementary DNA fragment selected by TCLT can be used as a probe to screen 
another library which contains the potential receptor (23). Second, the complementary anti- 
sense peptide encoded by the complementary DNA fragment can serve as an immunogen to 
10 generate antibodies which interact with the potential receptor with specificity (24-37). 

TCLT involves constructing (i) a peptide library containing a collection of 
complementary peptides of a target protein/peptide, wherein the complementary peptides in 
.~ aggregation complement all, substantially all (preferably no less than 70%, more preferable 

tD no less than 80%, even more preferably no less than 90%), or a significant portion of the 

m 

f j 15 target protein/peptide; or (ii) a nucleic acid library containing a collection of nucleic acids 

: p encoding the aforesaid complementary peptides. In the latter case, the nucleic acid library 

Q 

is inserted into expression vectors and introduced into genetic packages such as phage 
<|F particles and the complementary peptides are expressed on the surface of the genetic 

i~ packages. Peptide ligands are selected from the library by panning against the target 

IU 20 protein/peptide or the binding domain of the target protein/peptide. The target 

& protein/peptide or its binding domain is preferably immobilized on a solid support. Genetic 

□ packages which bind to the immobilized target protein/peptide can be amplified and subject 

! "~ to several more rounds of panning. Complementary peptides with sufficient specific 

affinity to the target protein/peptide are selected at the end of this process, and their 
25 sequences can be uncovered from the genome of the selected genetic packages. 

By "hydropathy character of an amino acid" is meant the hydrophobic or 
hydrophilic nature of the amino acid. Specifically, it is reflected by the hydropathy score of 
the amino acid in Tables 1-3 of the specification. The hydropathy scores of the twenty 
amino acids in Table 1-3 range from +4.5 to -4.5, and allow for the grouping of the twenty 
30 amino acids as hydrophobic (+4.5 to +1.8, Table 1), hydrophilic (-4.5 to -3.2, Table 2), and 
slightly hydrophilic (-0.4 to -1.6, Table 3) (Ref. 1 1). Hydrophobic amino acids in Table 1 
(i.e., He, Val, Leu, Phe, Cys, Met, and Ala) complement hydrophilic amino acids in Table 2 
(i.e., Arg, Lys, Asn, Asp, Gin, Glu, and His), and vice versa. Slightly hydrophilic amino 
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acids in Table 3 (i.e., Gly t Thr, Trp, Ser, Tyr, and Pro) complement each other. For 
example, each of Arg, Lys, Asn t Asp, Gin, Glu, and His is complementary to He; and each 
of Gly, Thr, Trp, Ser, Tyr, and Pro is complementary to Gly. In other words, each of Arg, 
Lys, Asn, Asp, Gin, Glu, and His is a complementary amino acid to lie; and each of Gly, 
5 Thr, Tip, Ser, Tyr, and Pro is a complementary amino acid to Gly. 

By "hydropathy profile of a peptide or protein" is meant its profile as defined 
by the hydropathic characters of amino acids in the peptide or protein. 

By "complementary peptide" is meant a peptide containing a contiguous 
amino acid sequence (in the direction from the amino terminus to the carboxy terminus or 
10 in the direction from the carboxy terminus to the amino terminus) which complements the 
amino acid sequence of the target protein/peptide according to the rule set forth above. Any 
target protein/peptide has a plethora of complementary peptides because each amino acid in 
the target protein/peptide has a number of complementary amino acids. 




^By "imti-sunse peptide" is meant a peptide encoded by the anti-serise Strand 
of the target gene translated in either 5' to 3* or 3* to 5* directions. For exan^pterinhesense 
strand of the target gene has 5' CUU GUU CUU UUU Tencedtn^apeptide Leu-Val-Leu- 
Phe, the anti-sense peptide would either be apeptiSehaving the sequence Lys-Lys-Asn-Lys 
as encoded by the anti-sense strand^XAA AAG AAC AAG 3' or a peptide having the 
sequence Glu-Gln-Glu^Lyfas encoded by 5'GAA CAA GAA AAA 3*. The matching 
20 pattern in TaJ>tejfl-3 shows that the anti-sense strand of a target gene usually encodes 
mentary peptide to the target protein/peptide. Anti-sense peptides which are 



=tyleiiientaiy 10 me target protein/peptide are called complementary anti-sense peptides. 
By "target complementary library (TCL)" is meant (i) a peptide library 
containing a collection of complementary peptides of a target protein/peptide, wherein the 
25 complementary peptides in aggregation complement all, substantially all (preferably no less 
than 70%, more preferable no less than 80%, even more preferably no less than 90%), or a 
significant portion of the target protein/peptide; or (ii) a nucleic acid library containing a 
collection of nucieic acids encoding the aforesaid complementary peptides. In a preferred 
embodiment, the complementary peptides are linked to flanking sequences. 
30 By "anti-sense target complementary library (anti-sense TCL)" is meant a 

target complementary library wherein the complementary peptides are also anti-sense 
peptides of the target protein/peptide. These anti-sense peptides in combination 
complement the amino acid sequence of the target protein/peptide in its entirety or 
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substantial entirety (preferably no less than 70%, more preferable no less than 80%, even 
more preferably no less than 90%). In a preferred embodiment, the anti-sense TCL is a 
gene fragment library wherein fragments of the anti-sense strand of the target protein are 
operatively linked to a promoter for expressing the anti-sense peptides. The sizes of the 
5 nucleic acid fragments encoding anti-sense peptides may be defined or random. The anti- 
sense peptides encoded by the gene fragments may overlap each other randomly or in a 
predefined manner. In a further preferred embodiment, fragments of the anti-sense strand 
are linked to flanking sequences. 

By "comprehensive target complementary library (cTCL)" is meant a target 
10 complementary library containing or encoding a collection of complementary peptides. 
Each complementary peptide of the collection complements the entire length of the target 
protein/peptide. In addition, this collection includes all possible complementary peptides to 
~ the target. In other words, each amino acid in the target is complemented by all its possible 

*fl complementary amino acids at the corresponding position respectively in different 

ll l 15 complementary peptides of the collection. For example, the cTCL of the target peptide He- 

-P Gly-Arg would contain a collection of complementary peptides represented by the formula 

i— 3 

jl? X-(Arg, Lys, Asn, Asp, Gin, Glu, or His)-(Gly, Thr, Trp, Ser, Tyr, or Pro)-(Ile, Val, Leu, 

•C Phe, Cys, Met, or Ala)-Y, wherein each position of the complementary peptides is fully 

randomized according to the formula and X and/or Y represent either nothing or one or 
\ y 20 more amino acid residues. This cTCL would have at least 7x6x7 = 294 different members. 

55 By "target protein/peptide" is meant any protein/peptide or a domain within 

G any protein/peptide. 

By "iigand" is meant a peptide or protein capable of binding directly to the 

target protein/peptide. 

25 By "binding domain" or "interaction domain" is meant a domain or region of 

a protein which is primarily responsible for the protein's ability to bind to another protein. 

By "genetic packages" is meant replicable entities capable of carrying the 
nucleic acids of a target complementary library (TCL), including, but not limited to, 
vegetative bacterial cells, bacterial spores, bacterial viruses and eukaryotic cells. Genetic 

30 packages allow the amplification and recovery of the encapsulated genetic message 
following selection. A preferred genetic package is a phage. A phage display library 
expresses a collection of cloned proteins on the surface of phage particles as fusion with a 
phage coat protein. In connection with the genetic packages, "select" or "selection" means 
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enriching a subpopulation of genetic packages displaying a desired phenotypic 
characteristic (such as affinity to a probe). 

Other features and advantages of the invention will be apparent from the 
detailed description of the invention below and from the list of enumerated embodiments 
that follows. 

DETAILED DESCRIPTION OF THE INVENTION 
The molecular recognition theory and complementary hydropathy 

Without being bound by any theory, applicant's invention is supported by 
the molecular recognition theory (MRT) proposed by Blalock et al. in 1984 (11). 

The molecular recognition theory arose from the observation that genetic 
codons for hydrophobic amino acids on the sense strand of a double-stranded DNA are 
complemented by genetic codons for hydrophilic amino acids on the anti-sense strand of the 
same ds DNA, and vice versa (11-13). The DNA sequences in which the amino acids 
encoded by one strand are hydropathically complemented by the amino acids encoded by 
the other strand are referred to as the complementary sequences. Two peptides encoded by 
complementary sequences in the same reading frame show a total interchange of their 
hydrophobic and hydrophilic amino acids when the amino terminus of one is aligned with 
the carboxy terminus of the other. The molecular recognition theory predicts that such 
inverted hydropathic pattern may allow pairs of complementary peptides to assume 
conformations that enable the pair to interact with each other specifically (11). 

This hypothesis was initially tested in the binding of corticotropin (ACTH) 
to its complementary anti-sense peptide (HTCA). Indeed, the synthetic peptide HTCA 
bound to ACTH in a saturable, high affinity, specific fashion (14). Later on, specific 
interactions between pairs of complementary peptides were shown by more than forty 
different proteins (summarized in 15). High affinity anti-sense peptides have been used in 
affinity chromatography to purify a number of biologically active proteins (16-19). 

It follows from the molecular recognition theory that antibodies against a 
complementary anti-sense peptide of a peptide ligand would bind to the receptor of this 
peptide ligand. In fact, scientists have used this approach to identify unknown receptors for 
known ligands, e.g., receptors for angiotensin II, fibronectin, vasopressin, p-endorphin, y- 
endorphin, and enkephalin (24-37). 

Recently, Martins et al. (36) identified the cellular receptor for human prion 
with an anti-sense peptide. They synthesized a complementary anti-sense peptide to the 
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neurotoxic region of the human prion (residues 106-126) and raised antibodies against this 
complementary anti-sense peptide. It turned out that these antibodies specifically stained 
the surface of neurons and recognized a 66-KD membrane protein which binds to prion 
(PrPc) both in vitro and in vivo. Furthermore, both the complementary anti-sense peptide 
and antiserum against it inhibit the toxicity of prion towards neuronal cells (36). 

Consistent with the molecular recognition theory, it has been reported in 
many instances that the complementary sense peptide of a ligand pairs with the binding 
domain of its receptor in the same way it pairs with its complementary anti-sense peptide at 
the DNA level and the protein level (20-23). In that regard, DNA probes derived from the 
coding sequence of peptide Iigands have been used successfully to identify cDN A clones of 
their receptors from a cDNA library (23). 

These examples show that the amino acid sequence of a receptor can be 
deduced from the amino acid sequence of its ligand at the region of their interaction, and 
vice versa, because these two sequences complement each other hydropatftically. It has 
been shown that a pair of idiotype and antiidiotype antibodies (Abl and Ab2) can be 
generated by using as immunogens a pair of complementary sense and anti-sense peptides. 

In a frugal biological system developed under constant evolutionary 
pressure, a peptide or protein that does not interact with another peptide or protein is of 
little value to cells and organisms. The molecular recognition theory suggests that nature 
may have evolved a fail-safe system to ensure that this does not occur. Because of the 
complementarity of nucleic acids and the interactive qualities of the peptides encoded 
thereby, a strand of nucleic acid contains not only the information for one peptide but also 
that for a second peptide which is meant to interact with the first (38). It has been 
postulated that the interacting domains (encoded by the functional exons) of two proteins, 
such as a receptor and its ligand, co-evolved in this manner (38-40). 

In human genes, short complementary sequences (about 15 AA in length on 
average) are frequently found at intervals of approximately fifty amino acids (15, 41). The 
majority of complementary sequences are amphophilic containing reverse turns, implying 
that these regions are on the molecular surface at the beginning or end of automatic folding 
units, helices and beta sheets. This phenomenon implies that these regions are important 
for folding and maintaining the structural integrity of proteins (15). Therefore, 
complementary anti-sense peptides which bind to a target protein would affect the 
biological activities of the target protein. Numerous examples have illustrated that 
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biologically active proteins such as antibodies, cytokines, and hormones can be functionally 
affected by the binding of complementary anti-sense peptides (42-56). . 

According to the molecular recognition theory, peptide ligands for any target 
protein could be identified among the complementary peptides of the target protein, 
5 numerous experimental observation also reveal the specific binding between peptides with 
opposite hydropathy profiles. Therefore, an efficient way of identifying ligands for a target 
is to screen a pool of complementary peptides. 

Sequences of complementary peptides to a target protein/peptide can be 
constructed by using computer programs (e.g., see 15, 70). However, because the "native" 
10 hydropathy of a peptide is not only determined by its sequence, but also strongly influenced 
by the flanking regions, a better TCL is prepared by linking appropriate flanking amino 
acids to complementary peptides (15). 

A relatively simple way of constructing a TCL is to prepare a gene fragment 



*S anti-sense TCL. In this method, the TCL is prepared by amplifying fragments of the target 

_ i 15 gene by polymerase chain reaction (PCR) using random hexamer oligonucleotide primers or 

other universal primers (57). Because sequences encoded by the anti-sense DNA strand of 
the target gene is just as well represented as that of the sense strand, a complete collection 
of the anti-sense peptides with their various flanking sequences are encoded by 
approximately one half of the genetic packages so prepared. Optionally, the fraction 
20 encoding the sense strand peptides could be excluded by subtractive hybridization 
following the generation of a single stranded nucleic acid (e.g., by transcription). 

A cTCL can be synthesized as follows: residues with complementary 
hydropathy scores (as defined in Tables 1-3 and refs. 1 1-13) to each residue in the target 
sequence are randomly incorporated into the corresponding positions in the library. As a 
25 result, the TCL contains all the peptides which complement the target sequence 

hydropathically. This method is particularly suitable for screening for peptide ligands when 
the target is a short region in a protein of known sequence. 

TCL can be displayed on phage as either fusion proteins with the minor 
phage coat protein pin or the fusion proteins with the major phage coat protein pVIII. 
30 Three to five copies of pin fusion proteins are expressed per phage particle, therefore they 
are used to display peptides with, good binding affinities. Because pVIII fusion proteins are 
expressed at a much higher level at about 2,700 copies per particle, they can be used in a 
first screen when the binding affinities are suspected to be weak (1). 
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To select for peptides in the cTCL which have higher affinity to the target 
than the complementary anti-sense peptide, the complementary anti-sense peptide is 
synthesized and included in the elution buffer used to elute phage particles from the 
immobilized target protein/peptide. The concentration of the complementary anti-sense 
peptide determines the stringency of the elution buffer, the higher the concentration, the 
higher the stringency. Phage particles which withstand the high stringency elution carry 
peptides with high affinity to the target protein/peptide. 

Once a binding peptide is identified through an initial screening, the 
sequence of this peptide is used as a guide for the generation of a dedicated library which 
contains various mutants of the initial binding peptide. Peptides with better binding affinity 
than the binding peptide from the initial screening will be selected by panning the dedicated 
library against the immobilized target protein. This procedure is described in detail in U.S. 
patent 5,223,409, incorporated by reference herein in its entirety. 

Peptides with higher affinity can also be generated by linking a peptide 
identified by TCLT to the polymerization domain of a coiled-coil protein (58), such as 
GCN4(59) or COMP(60), to create a tetravalent or pentavalent binding molecule. An 
increase in affinity by ltf-lO 6 folds has been demonstrated with this approach (60). 

Occasionally, high-affinity ligands could not be found from libraries of 
linear and unstructured peptides because a constrained peptide with the right conformation 
confers more thermodynamic advantage than a peptide which changes its shape constantly 
(61). This can be remedied by introducing some structural constraints such as disulfide 
bonds to the peptides, to enhance the chance of finding a high-affinity ligand. A 
constrained TCL can be prepared by carrying PCR reaction with cytosine-containing 
random hexamer oligonucleotide primers in which either cytosine is present at various 
positions in every hexamer oligonucleotide primers, or one additional cytosine is added at 
the 5*- or 3*- end of the random hexamer oligonucleotide sequences. Cystein can be 
synthesized at both ends of the synthetic library. 

Example I: Design an d Construction of a Comprehensive Target Complementary 

Library (cTCU 

^^p 7 ' — Phage fU SED is used as the vector for the cTCL. The library ls madfc by^^ 
>figation of synthetic degenerate Bgl£deoxyjiiigoiw^ I site of 

thefUSE5^and-^^ E, coli with the ligation products by electroporation. The 

^rUSE5 vectui and the cloning procedure is described in detail in ret. / 1, 72 arid 89. The — 
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-gymheric neffgneffa kuRgll ripnxyn1ignniicleo ti rif> * fragments arp yynrhi>oi^ fry p hqg- 
^4)hosphoramidne chemistry iyu), and carry the following Sequence: "=* 

5'-CTGTCAGGGCCCGAGGGGCT(XXX)nGGGGCCGCTGCGGCCTGTCAGG- 
3'(SEQIDNO. 1) 

In the above sequence, n is the number of amino acid residues in the target 
peptide. The degenerate sequence of (XXX)n is designed by the following principle: 
hydrophobic amino acids which are complementary to the hydrophilic amino acids in the 
target peptide are randomly incorporated in the same positions in cTCL; hydrophilic amino 
acids which are complementary to the hydrophobic amino acids in the target peptide are 
randomly incorporated in the same positions in sTCL. Slightly hydrophilic amino acids are 
used in sTCL corresponding to similar residues in the target peptide. The hydrophilic and 
hydrophobic amino acid residues are defined by their hydropathic scores as given in Tables 
1-3. 

To synthesize XXX triplets for hydrophilic amino acids, the second base 
comprises a mixture containing A and G at molar ratio 6:1; the first base consists of 
equimolar mixture of A, G, and C. Thymine is not included in the mixture because its 
presence in the first position would give rise to the stop codon TAA or TAG (91). The 
third base includes only an equimolar mixture of G and C which is designed to favor 
codons used by E. coli to express its most abundant proteins (91). 

To synthesize XXX triplets for hydrophobic amino acids, the second base 
comprises a mixture containing T and C at molar ratio 5:2; the first base consists of mixture 
containing an A:T:C:G molar ratios of 3:3:3:1. The third base again includes only an 
equimolar mixture of G and C. 

To synthesize XXX triplets for slightly hydrophilic amino acids, the first 
base consists of an equimolar mixture of A, T, C and G; the second base contains a mixture 
of G, C, and A molar ration 2:2: 1, and third base includes an equimolar mixture of G and C. 

The oligonucleotides are purified by denaturing PAGE, their complementary 
strands are synthesized by Klenow DNA polymerase (91). They are subsequently digested 
with Bgll and ligated with RJSE5 (7 1 , 72, 89). 

A cTCL carrying peptides with ten amino acid residues will contain 
approximately 2.8x1 0 8 independent clones, which can be completely amplified as a phage 
display library. A cTCL carrying peptides with eleven amino acids will have approximately 
2xl0 9 clones. 
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Example 2: Blocking antigen bindine activity of an antibody with peptide ligands 

targeting the second framework segment (FR2) in the heavy and light chain variable 
regions (VH and VL) 

Variable regions of both the heavy chain and light chain of an 
immunoglobulin molecule are composed of four framework segments (FR1, FR2, FR3 and 
FR4) and three complementarity-determining segments (CDR1, CDR2 and CDR3) with the 
three CDRs separating the four FRs. The residue numbers for these segments were defined 
by Kabat et al. (80). The CDRs are the segments with highly variable amino acid sequences 
and are primarily responsible for antigenic specificity of each antibody. The FRs are the 
relatively conserved segments which provides structural framework for the antigen-binding 
domain of an antibody. The FR2 of light chain constitutes amino acid 35 to 49, and FR2 of 
heavy chain constitutes amino acid 36 to 49 (80). 

The antigen-binding site is formed by the convergence of six hyper- variable 
peptide loops which represent six CDRs of both heavy and light chains (81). The FR2 
segments of each chain also directly contribute to an antigen-binding site. The FR2 
segments of heavy and light chains constitutes the side and base of the antigen-binding 
surface formed by six CDRs, and appear to be an integral part of the site. Parts of the FR2 
seem to be as exposed as the CDR3 loops. In addition, the closest contact points between 
VH and VL domains of Fab are between the FR2 of the heavy chain and the CDR3 of the 
light chain, and vice versa. Therefore, the FR2 plays an important role in keeping the 
structural integrity of Fab since VH and VL need to be closely apposed to each other to 
form a complete antigen-binding site. Consequently, a polypeptide (or a chemical 
compound) which specifically binds to the FR2 segment can disrupt the antigen-binding 
ability of an antibody molecule by (1) altering the topography of an antigen-binding site and 
thereby preventing antigen binding; and/or (2) acting as a structural hindrance to prevent 
close contact between the FR2 and the CDR3s in both chains and thereby disrupting the 
structural integrity of an antigen-binding site. 

Because FR2 is the most conserved segment in the variable region, a peptide 
targeting FR2 may mask antigen-binding activity of antibodies regardless of their antigenic 
specificities. Furthermore, peptides targeting the FR2 segments in antibodies of different 
isotypes can be used to mask antigen-binding activities of all the antibodies of the same 
isotype, i.e., IgE, IgG, IgA, IgM and IgD. 

An FR2 targeting peptide can be identified by the following approaches: 
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Synthesize a TCL based on the sequences of FR2 segments and the 
complementary hydropathy and select peptide ligands by their ability to bind to and disrupt 
the functions of antibodies with different antigenic specificities. The following examples 
illustrate how to identify anti-IgE peptides. 

A very simplified version of this approach is to identify the FR2 
complementary sequences by computer programs (15, 70). Such approach generally 
produce peptides with binding affinity within the range of ltf-lO 6 M" 1 . The affinity can be 
improved by screening a constrained TCL which contains various flanking sequences and 
cysteins at both sides of the complementary peptides (61). 

Example 3: Blocking Human Rhinovirus (HR V) Major Receptor (ICAM- 1 ) with Peptide 
Ligands 

-Woman rninoviruses cause about 70% of common cold. ICAM-1 serves as 
the cellular receptor for majority of HRVs (82). The extracellular part of the ICA^K^^ 
molecule is composed by five immunoglobulin-like domains (D1-D5). Mutational' analysis 
of ICAM- 1 has shown that domain Dl contains the primary bin^irrffsite for rhinoviruses as 
well as the binding site for its natural ligand lymphocyte^cfnction-associated antigen 1 
(LFA-1) (83-86). The regions in Dl which havepeon implicated as the contact sites with 
HRVs include residues 1, 2, 24-29, 40-49^6^70-77 (83-87). Accordingly, peptide ligands 
of ICAM-1 targeting these regions ma^prevent the binding of HRVs to ICAM-1. Suitable 
peptide targets include, but arp-tfot limited to the following: 

Residues 1-5: QrfsVS (SEQ ID NO. 2) 

Residues24<29: SCDQPK (SEQ ID NO. 3) 

ReskWes 40-49: KELLLPGNNR (SEQ ID NO. 4) 

E^kitigs70-77: PDGQSTAk (ShQ lb NO. b) — - 

Peptide ligands capable of binding to each of the above target peptides are 
identified by the following procedure: 

(1) Preparing cTCL (or constrained cTCL) for each target peptide, and displaying 
the library on phage; (2) Immobilizing each target peptide on a solid surface (Alternatively, 
the D1-D2 part of ICAM-1 molecular is expressed and purified in vitro as described in ref 
88, coated onto a solid surface, and used to screen cTCLs); (3) Selecting phage particles by 
panning as described in detail above; and (4) Recovering the peptide sequences from the 
selected phage panicles. 
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Peptide ligands identified by screening cTCLs are synthesized and tested for 
their ability to prevent HRVs infection to Hela cells. The experimental protocols are 
described in US provisional patent application 60/067,1 19 filed December 1, 1997, 
incorporated by reference herein. 

The affinity of the peptides can be improved by linking the peptides with the 
polymerization domain of a coiled-coil protein to create a multivalent binding molecule as 
described in detail in provisional patent application 60/067,1 19. 

Example 4: Identifying anti-IgE peptides with TCLT 

The most common form of allergic reaction is allergic rhinitis (nasal allergy) 
which affects about 20% of population (66). Allergic rhinitis can be either seasonal which 
indicates pollen -allergen sensitivity (also referred to as hay fever), or throughout the year 
(perennial rhinitis) which tend to be caused by house dust, dust mites, and animal danders 
(66). Allergic rhinitis is associated with irritating, sometimes disabling symptoms 
involving upper respiratory tract and eyes, it also imposes morbidity by leading to frequent 
secondary infections and even asthma attacks. 

At cellular level, nasal allergy is caused by inflammatory mediators released 
by the mast cells, such as histamine, serotonin, prostaglandins, and various cytokines. 
Nasal allergy is also associated with basophils which have been presensitized by IgE. 

IgE is the least component of the immunoglobulin family in human and 
animals. Its concentration in normal human serum is very low. Most IgE antibodies bind to 
the surface of mast cells and basophils through the Fc type 1 receptors (FcRI). Binding of 
IgE to the corresponding allergen causes clustering of IgE molecules which in turn activates 
mast cells and basophils and results in the release of inflammatory mediators. 

Many attempts have been made to block the binding of IgE to FcRI as a way 
to prevent allergy. However, none of them have been successful so far in dissociating the 
preformed IgE-FcRI complex because the affinity between IgE and FcRI is as high as any 
good antibody-antigen interaction with the affinity constant (kx) measuring in the range of 
10' 10 M" 1 (67, 68). In addition, trying to prevent allergy by antibodies against other parts of 
the IgE molecule is almost bound to fail because an anti-IgE antibody can cause cross- 
linking of the IgE molecules on cell surface and thereby activate the mast cells and 
basophils and triggers an allergic reaction. 

One approach of preventing allergy is to mask the antigen-binding site of 
IgE with a peptide which only binds to a single IgE molecule and does not cause cross- 
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linking of the IgE molecules. Such peptides target IgE at or near the antigen binding site 
and prevent antigen binding by causing steric hindrance to the binding site. 

It would be preferable that the peptides target the conserved framework 
regions rather than the hypervariable regions in order to affect IgE molecules of a wide 
range of antigenic specificities. 

TCLT is suitable for finding such peptides. First, it is well-established that 
the antigen binding site in an antibody molecule (Abl) can be interfered by anti-idiotype 
antibodies (Ab2) against this antibody (Abl). One type of anti-idiotype antibody, Ab2beta, 
is the direct mirror image of the antibody binding site on Abl . Another type of anti- 
idiotype antibody, Ab2gamma, is directed against an idiotype near the binding site and 
causes steric hindrance to the antigen binding site (62). Second, idiotypes of an antibody 
are composed by peptide sequences found in both the hypervariable regions (CDRs) and the 
conserved framework regions of a antibody molecule (62, 69, 70). Therefore, a peptide 
targeting the conserved region of an idiotype can alter the idiotype and affect the antigen- 
binding ability of an entire class of antibody, such as IgE, IgG and IgA. Kang et al. have 
demonstrated the presence of complementary sequences in human immunoglobulin (52). 

Peptides affecting antigen-binding ability of human IgE molecules can be 
identified with TCLT as described in the following examples. 

Example 5: Blocking antigen-binding ability of an antibody molecule by targeting the 
framework 2 (FR2) region of the antibody 

B ased oir trieir sequences, the hfeavy chain (H) ot an aritibody can be Classitiefl into 
amilies (V H 1 to V H 6). The general sequences of FR2 region in each family are in tysr 
following: ^ 

V H 1: WVR/QQA P/H/T G/A K/Q G/E/R/A L E/G^f^UlG (SEQ ID NO. 6) 

V H 2: WIRQPPGKALEWLA (SEQJBrf^l) 

V H 3: WV/IR/HQAP/QG K GI^/P^I/V WA7L V S/A/G (SEQ ID NO. 8) 

V H 4: WI/VRQPP G KGITe W I G (SEQ ID NO. 9) 

V H 5: W V R Q MJKTk G/E LEWMG (SEQ ID NO. 10) 

V H 6: WI^<3SPSRGLEWLG(SEQIDNO. 11) 

The spqiience of the light chain (L) of an antibody is: 

ag^Jcappa: WYQQKPfl O/TC P/S/A P K I T I V r^F_QJJC^v-4^ — „ 

Human IgE molecules are mostly composed of heavy chain genes from the V H 5, 
V H 3, V H 4 and V H 6 families (92,93), therefore the FR2 sequences from these families plus 
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the FR2 from the light chain serve as the target sequences for an IgE blocking polypeptide 
in this invention. 

The IgE-blocking peptide can be designed directly based on the molecular 
recognition theory as in the following: 

Tep U de l(bimh> t o FR2 in V H bJ: PDALHGPFA Q(or D) L P H PjSEQm 
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Poptidc 2(bind3 to FR2 in V H 3, Vh4 and V H 6). P D A L Q/R OPFA Q/D LPNP 
P ept ide 3(biiidH H u FR2 in V L kappa ChairiJ! PVLLM<PLKG^fc,hLU (ShQ ID 



To isolate peptides with higher affinity to the FR2 region of IgE, a human IgE anti- 
sense TCL or comprehensive TCL library can be screen using the above FR2 target 
sequences. 

An alternative way to generate human IgE blocking polypeptide which targets the 
FR2 region is to screen from a phage display human antibody library. In this case, each 
peptide encoded by the targeted FR2 sequence is synthesized artificially and used for 
panning as described below. 

Example 6: Constructi on of a human IgE anti-sense TCL on bacteria phage 

The human IgE anti-sense TCL is prepared by using coding sequences of the 
cDNA clones of human IgE light chain and heavy chain. The cDNAs of human IgE are 
separated from the vector sequences by agarose gel electrophoresis. Equal amounts of each 
purified inserts are mixed and used as templates for a random priming reaction. 

A uuiu Aiiiid t eJv ^Orifrnfthf* mivpri timipiafA ; s hnii^ W j t h | rT II ,, | |, r ,i„ 
primer No. 1 (5 , -GACGTGGCC^-3 , f N can be A, T, C, or G)joj^-mhTr^^ 
mixed with a reaction mixture containing lOmJ^Jris^^ 7.5), 10 mM MgC12, 1 mM 
dithiothreitol, 0.125 mM each^JPrafialT units of Klenow fragment of DNA 
polymerase and incubotetrat37 °C for 15 min. Then the reaction mixture is boiled and 



20 





cooIedj^iarSnother 2.5 units of Klenow fragment is added, and the reaction mixture is 
rtcuba^^ 

The reaction is stopped by~6oiling and diluted 1:10 withTEbuf 
products are separated from primersusin^M^^ (Amicon) 
with two washesj>£4heT^CaI^ with 200 ml of the TE buffer. After two cycles of 

^pruning, some react i on products will inco rpora te primet sequences at both ends. Tft ey-are- 
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amplified" in PCK using pnmer No. 2 ( 3-UGCCGACGTGGCC-3') (SEQIDNJ 
amplified products are precipitated, purified with_Mi$a»een^ sfil and cloned 

into the sfil site of Xh^jUSE^ of the IgE-TCL follows the 

methods dssgxtbeSTby Smith and Scott (71), the details which are given in the manual 

E pfipri wi t h4hc fUSE oxpros c ion kit by Smith (72) , . 

GACGj j ^^ ID NO. 1 1) and pnmer No. 

-3* (SEQ ID NO. 18) are used to generate a constrained 




Example 7: Selection of the IgE b inding complementary anti-sense peptide by panning 
the human IgE TCL 

Human IgE is diluted in PBS to a concentration of 20 mg/ml and is used to 
coat 3.5 cm wells by incubating for 1 h at 4 °C The remaining binding sites are saturated 
by bovine serum albumin (BS A). A portion of the amplified IgE-TCL is first incubated for 
2 h at 4 °C in a 3.5 cm" well precoated with 1 mg/ml BSA in PBS and 1 raM MnC12. The 
phage unbound to BSA are transferred to a similar well precoated with human IgE. After 
incubation for 1 h at 4 °C, the unbound phage are removed by washing 10 times with PBS 
buffer containing 0.5% Tween 20. The bound phage are eluted with 0.1 M glycine buffer, 
pH^2.2, containing 1 mg/ml BSA and 0.1 mg/ml phenol red. 

The phag CL oi l amplified ui i ing the KQlkan bacteria and partial^ 
precipitation with polyethylene glycol (^jniej^ rounds. 
Sequences carried byjhejeleetetf^ then determined using the Sequenase kit 
(United^tesHBlo^emical) with the primer 5 * -CCCTC ATAGTTA AGCGTA ACG-3 ' (73) 

Example 8: Characterizing anti-IgE activities of the complementary anti-sense peptides 

The peptides selected from the previous step are synthesized and 
individually tested in the following assays. 

Assa Y 1: Histamine release bv passively sensitized basop hils. 

The procedure is described in detail previously (74). Briefly, reaginic sera 
are obtained from allergic patients who are allergic to ragweed, rye grass, Chinese elm or 
other preidentified allergens. Peripheral blood mononuclear cells are isolated from normal 
individuals by differential centrifugation on Ficoll-Pague (Pharmacia). Aliquots of a cell 
suspension are incubated with 2- to 10- fold dilution of a reaginic serum, Tris A buffer 



- 16- 



WO 99/55911 PCTAJS99/06537 

containing 0.025 M Tris, 0.12 M NaCl, 0.005 M KC1, and 0.05% human serum albumin 
(pH 7.5) at 37 °C for 90 min. 

The cells are then washed three times with Tris A buffer and resuspended in 
Tris- ACM buffer (pH 7.6) which consists of Tris A buffer plus 0.6 mM CaC12 and 0.01 
mM MgC12. They are incubated with peptides at various concentrations for 30 min at 37 
°C and then are challenged with either purified ragweed antigen E (0. 1 mg/ml) or crude 
ragweed antigen, crude Chinese elm extracts, and rye grass extracts (Hollister-Stier, 
Spokane, WA). 

After 40-min incubation at 37 °C, histamine content in supernatants is 
measured by the method of Siraganian (75). The peptides which can prevent histamine 
release triggered by all allergens will be selected. 

Assay 2: Prausnitz-KDstner reactions 

The procedure is carried out as described previously (74, 75). The reactions 
are done in a single healthy individual whose serum IgE concentration is at low level 
(around 10 ng/ml). To determine the abilities to block passive sensitization, 20 ml of 
various concentrations of the peptides is injected intracutaneously into skin sites 1 hr before 
sensitization of the skin sites with 20 ml of a 50-fold dilution of allergic serum (IgE at 912 
ng/ml). 

In each experiment, control site receives only the diluting buffer before the 
sensitization. All skin sites are challenged 48 hr after the passive sensitization with allergen 
extract, such as crude ragweed antigen. The size of wheat and erythema reactions are 
measured 20 min later by using transparent tape to transfer the outline of the reactions to a 
paper and expressed by the weight of the paper (76). It has been demonstrated that 
unsensitized skin sites do not show any wheat or flare reaction upon antigen challenge. 
Peptides which can prevent or lessen the reactions will be selected. 
Example 9: Measuring affinity of the IgE-binding peptides and improvement of the 
peptides 

The binding of peptides to human IgE can be analyzed by BIAcore biosensor 
(Pharmacia Biosensor AB, Uppsala, Sweden) (77, see 78 for review). Experiments are 
carried out at 25 °C in 10 mM HEPES buffered saline (HBS) with 150 mM NaCl, 3.4 mM 
EDTA and 0.005% Surfactant P20 (Pharmacia). 

Human IgE is coupled to the sensor chip by activating the 
carboxymethylated dextran matrix with 0.2 M N-ethyI-N*-(3-diethylaminopropyl)- 
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carbodiimide and 0.05 M N-hydroxysuccinimide (NHS) (Pharmacia Amine Coupling Kit), 
followed by addition of the human IgE (80 mg/ml, 10 mM sodium acetate, pH 4.5). 
Remaining NHS-esters are blocked with a pulse of 1.0 M ethanolamine hydrochloride, pH 
8.5. 

To monitor the binding of peptides to IgE, various concentrations of peptide 
are injected over the immobilized IgE using conditions where binding is limited by kinetic 
parameters. Under these conditions, the solution association constant (KA) for the IgE- 
peptide interaction can be calculated using the following equation: 

[SL]=KA[S]t[L]/(l+KA[L]) (79) 
where [SL] is the concentration of the IgE-peptide complex at equilibrium, [L] is the 
concentration of the free peptide, and [S]t is the total concentration of IgE. 

Because the peptides selected in the present invention prevent the binding of 
allergens to IgE by steric hindrance rather than directly competing with the allergens for 
binding sites on IgE, peptides with apparent affinity comparable or even lower than the 
affinity between IgE and allergens can be effective in preventing the binding of allergens as 
long as enough concentration of the peptides is maintained locally so that they remain 
bound to IgE when the presensitized mast cells or basophils encounter the allergens. 
Nevertheless, if peptides with higher affinity are desired, an in vitro affinity maturation 
process can be performed using the TCLT-selected peptides as templates. The details of 
this approach are described in U.S. patent 5,223,409, incorporated by reference herein in its 
entirety. 

The human IgE FR2 segment binding peptides can also be selected by 
constructing a cTCL against the IgE FR2 segment, and screening the cTCL with a synthetic 
peptide containing the IgE FR2 sequence. The screening procedure and characterization of 
the selected peptides are carried out as described above. 
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